Resolving Ambiguous Entity through Context Knowledge and Fuzzy Approach
نویسندگان
چکیده
Entity extraction is considered as a fundamental step in many text mining applications such as machine translation, text summarization and text categorization. However, the major challenging issue in extracting the entity from a sentence is the ambiguity problem, namely lexical ambiguity. While a human has a cognitive capability to resolve the meaning easily based on his/her knowledge, it is very difficult for a machine to do so. This paper proposed a new technique for resolving the ambiguity problem through a fuzzy approach and context knowledge. The technique integrates subject and lexical knowledge, the possibility theory, and fuzzy sets into natural language processing. Lexical knowledge was obtained from WordNet, while subject and lexical knowledge have been deployed as context knowledge. Possibility theory and fuzzy sets were applied to select the most possible meaning of an ambiguous entity based on the context. The work was conducted on the noun part-of-speech only. The technique was implemented and tested with 1110 sentences. Precision and recall measurement metrics were used as an evaluation metric. The obtained precision rate is 85.7% and 80.3% for recall. The results indicate that the proposed technique is successful. (Abstract) Keywords-natural language processing; ambiguity; context knowledge, fuzzy approach; information extraction
منابع مشابه
Lexical Disambiguation in Natural Language Questions (NLQs)
Question processing is a fundamental step in a question answering (QA) application, and its quality impacts the performance of QA application. The major challenging issue in processing question is how to extract semantic of natural language questions (NLQs). A human language is ambiguous. Ambiguity may occur at two levels; lexical and syntactic. In this paper, we propose a new approach for reso...
متن کاملResolving Ambiguous Preposition Phrase Using Genetic Algorithm
Text mining refers to the process of discovering interesting and non trivial patterns or knowledge embedded in unstructured text documents from a fixed domain. It is also known as knowledge discovery from text databases. Text mining tasks include text categorization, text clustering, concept/entity extraction, document summarization and entity relation modelling. Extracting concept/fact from th...
متن کاملAlgorithms to Resolve Conflict in Multiuser Context Aware Ubiquitous Environment
Conflict resolution in context-aware computing is getting more significant attention from researchers as pervasive/ubiquitous computing environments take into account multiple users and multiple applications. In multi-user ubiquitous computing environments, conflicts among user’s contexts need to be detected and resolved. Conflicts arise when multiple users try to access or try to have a contro...
متن کاملAn Executive Approach Based On the Production of Fuzzy Ontology Using the Semantic Web Rule Language Method (SWRL)
Today, the need to deal with ambiguous information in semantic web languages is increasing. Ontology is an important part of the W3C standards for the semantic web, used to define a conceptual standard vocabulary for the exchange of data between systems, the provision of reusable databases, and the facilitation of collaboration across multiple systems. However, classical ontology is not enough ...
متن کاملCommunicating with Cost-based Implicature: a Game-Theoretic Approach to Ambiguity
A game-theoretic approach to linguistic communication predicts that speakers can meaningfully use ambiguous forms in a discourse context in which only one of several available referents has a costly unambiguous form and in which rational interlocutors share knowledge of production costs. If a speaker produces a low-cost ambiguous form to avoid using the high-cost unambiguous form, a rational li...
متن کامل